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SUZUKI GROUPS AS EXPANDERS 



EMMANUEL BREUILLARD, BEN GREEN, AND TERENCE TAG 



O 

^s^ ' Abstract. We show that pairs of generators for the family Sz(q) of 

Suzuki groups may be selected so that the corresponding Gayley graphs 

are expanders. By combining this with several deep works of Kassabov, 

Lubotzky and Nikolov, this establishes that the family of all non-abelian 

finite simple groups can be made into expanders in a uniform fashion. 



1. Introduction 



Let X be a graph and let e > be a real number. We say that X is 
an £-expander if, for all sets A consisting of at most half the vertices of X, 
we have \dA\ ^ £\A\. Here, dA refers to the boundary of A, that is to say 
those vertices that are joined to a vertex in A but do not themselves lie in 
A. There is a huge literature on expander graphs, and we refer the reader 
to [13] for a comprehensive discussion. Much attention has been devoted to 
the particular case of Cayley graphs Cay(G, S). Given a finite group G and 
a symmetric generating set S, one defines Cay(G, S) to be the graph with 
t^^ ■ vertex set G in which x and y are joined to an edge if and only il x = ys for 

^^ I some s ^ S. Note that ii \S\ = k then the Cayley graph is fe-regular. 

lO ' The following remarkable result was announced in [T7], based on several 

^P , earlier works of subsets of the authors of that paper. 

Theorem 1.1 (Every* non-abelian finite simple group is an expander). 
There exist /c G N and e > such that, for every* non-abelian finite sim- 
ple group G, one may select a symmetric set S of k generators for which 
^ ■ Cay(G, S) is an e-expander. 

C^ \ 

The asterisk means that this is true with the possible exception of a single 

family of simple groups, the Suzuki groups Sz(g). We will recall the definition 

of these in Section [3l We remark that the main reason why the techniques 

in [17] do not extend to the Suzuki case is that those methods rely on G 

containing an embedded copy of SL2(^') for some large g'; however this is 

not so for Suzuki groups since |SL2(g')l is always divisible by 3, whereas 

I Sz((7)| is not. 

Our main aim in this paper is to remove the lacuna in Theorem ll.il 

Theorem 1.2 (Suzuki groups are expanders). There exists e > such that, 
for every Suzuki group G = Sz[q), one may select a generating pair a,b 
in G such that, setting S := {a^^,b^^}, the Cayley graph Cay{G,S) is an 
e-expander. Thus Theorem \1.1\ is true without any exception. 
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Remarks. Our method is probabilistic and thus does not provide an ex- 
phcit generating set. In fact, we shall prove that a random pair of elements 
a, b will generate G and have the above expansion property with probability 
going to 1 as g ^ oo. 

It is remarked in [TTj that careful computation ought to yield k = 1000 
and e = 10"^" as acceptable values in Theorem ll.il As it stands our method, 
though it gives the rather superior value A: = 4, does not give any explicit 
value of e. However one could in principle replace all of the quantitative 
algebraic geometry arguments in Appendix A of [3] by effective arguments, 
rather than using ultrafilters as we did there. Furthermore, this would only 
need to be done in the case G = Sp4 of the main theorem in [3]. 

Finally, let us note that the tendency to define group expansion in terms 
of graphs is a matter of custom, designed to draw attention to the much 
wider world of graph expanders. However, for our purposes a completely 
equivalent definition involving only the group is as follows: G is an expander 
with generating set S if and only if |j4A^S'| ^ £\A\ for all sets ^ C G with 
1^1 ^ |G|/2. Perhaps even more naturally, G is an expander with generating 
set S if and only if \AS'\ ^ (1 + e)\A\ for ah sets A with |^| ^ \G\/2, where 
5':=5u{idG}. 

Ngtation. We use the asymptotic notation X = 0{Y), X <^ Y, or 
Y ^ X to denote the estimate l-'^l ^ CY for some absolute constant C. If 
we need C to depend on additional parameters then we will indicate this 
by subscripts; thus for instance X <Ci Y denotes the estimate \X\ ^ CiY 
for some Ci depending only on /. We use Og_^oo(l) to denote a quantity 
that is bounded in magnitude by c{q) for some quantity c(g) depending on 
q that goes to zero as g — )■ oo. If ^ is a non-empty finite set, we use ^xeA 
as shorthand for -m X^^g^, where |^| denotes the cardinality of A. 
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2. An OVERVIEW OF THE ARGUMENT 

The basic scheme of the argument is the same as that used by Bourgain 
and Gamburd [2] to show that Zariski-dense (or random) pairs of generators 
give expanders in SL2(Fp). The argument requires three main ingredients. 
The first is a "Helfgott-type result", that is to say a statement about the 
structure of "approximate subgroups" of G, of a type first obtained in [12]. 
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The notions of approximate subgroup and of "control" (which features be- 
low) were also used in the context of Bourgain and Gamburd's methods in 

Theorem 2.1. Suppose that A C Sz{q) is a K -approximate group that 
generates Sz(g). Then A is K^^' -controlled by either the trivial group {id} 
or the whole group Sz{q). 

Proof. This follows immediately from [21, Theorem 4], since Sz{q) is a 
simple group of Lie type. It may also be deduced from [3j, using the the fact 
that Sz{q) is a "sufficiently Zariski-dense" subgroup of the Chevalley group 
Sp4(F2) of 4 X 4 symplectic matrices over F2, that is to say the smallest 
degree of any nontrivial polynomial on Sp4(F2) that vanishes on Sz(q) tends 
to infinity with q. That this is so follows from the work of Jones [15j , 
reproduced as Lemma 13.31 in the present paper; it was used in a related 
context by Larsen [19]. D 



The next ingredient is an assertion that Suzuki groups are highly quasir- 
andom in the sense of Gowers [9] . 

Theorem 2.2 (Quasirandomness). The smallest dimension of an irreducible 
representation ofSz{q) is bounded below by cq'^''^ for some absolute constant 
00. 

Proof. This follows from the paper of Landazuri and Seitz |l8j . The 
exponent 3/2 is not essential for our purposes; any lower bound which was 
polynomial in q would suffice. D 

The third ingredient is not in the previous literature and is therefore the 
beef of our paper: it asserts that random walks on the Suzuki group do not 
concentrate in a subgroup. 

Theorem 2.3 (Non-concentration estimate). For some choice of a,b S 
Sz{q) and for some constants C and 6 > 0, we have 

sup/.g(i/)<g-^ (2.1) 

H<G 

for all n ^ Clogq, where the supremum ranges over all proper subgroups H 
of G, and where fiafi is the probability measure assigning weight 1/4 to each 

of the four points a, a~^, 6, b~^ , and ^^l is the n-fold convolution. 

In fact we will prove that this statement holds for a proportion 1— Og_j.oo(l) 
of all pairs {a,b) in Sz{q) x Sz{q). 

The Bourgain-Gamburd argument from [2] yields Theorem 11.21 as a con- 
sequence of Theorems 12.11 12.21 and 12. 3j see [10] or [4] for further discussion. 
Thus, the only remaining task is to establish Theorem 12.31 

There are several steps necessary to do this, but we take advantage of the 
fact that maximal proper subgroups of Sz(g) are either 3-step solvable or else 
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conjugates of Sz(go) for some qo = q^^^' , r G N, r ^ 3 (note that q 



J/r 



22n+l 

SO we could not have r = 2; this point will be helpful later on); see Lemma 
I3.2[ The solvability is particularly helpful and allows us to avoid the theory 
of random matrix products, important in more recent applications of the 
Bourgain-Gamburd method, by following a simpler argument very similar 
to that used in f2j. Another key ingredient will be that almost all pairs (a, b) 
in Sz(q) satisfy no relation of length up to nlogq, for some k > 0, a fact 
first proved in [8j (see also Appendix IBJ below) . 

3. Basic facts about Suzuki groups 

There are various conceptually enlightening ways to define the Suzuki 
groups: we refer the reader to O [23] for some of them. For our purposes 
it is more convenient to proceed using a quite explicit presentation of these 
groups using 4x4 matrices over fields of characteristic two (although we will 
not, in this paper, really need to know the detailed form of this presentation). 
Such a presentation was given in Suzuki's original paper [23]. A closely- 
related parametrisation was used in a paper of [15]. Since we will require 
results from Jones' paper, we use his particular parametrisation below. 

Here, and for the rest of the paper, we will set q := 2^*^"*"^ and consider 



2"^^. Then, for any x G Fg, we have (x 



r.2. 



the finite field ¥g. Set 6 r- 

that is to say, the map x \-^ x acts as a "square root" of the Frobenius 

map X I— 7- x^. As Fg has characteristic 2, the map x i— t- x is of course an 

automorphism. 



Definition 3.1 (Suzuki group). Suppose that a,b,a,f3 £ Fg and c, 7 G F^ 
Define 4x4 matrices over ¥g by 

( 

u{a, b,a, P) := 



1 

a 
aa + (3 
\a'^a + a/3 + 6 




1 

a 

13 





1 

a 







d(c,7) := 






Then, setting U{a, (3) := 



7 



\ 





n(a^,/3^ 



,r 



7-1C-7 

, a, (3) and D{^) : 






V 





1/ 




1 





1\ 






(i(7^,7), we define 



Sz{q) := {U{a,(3)D{j)TU{a',^') : a,a',/3,/3' G Fg,7 G F^^} 

U{C/(a,/3)Z)(7):a,/3GFg,7GFg^}. 

Throughout the paper we write B for the subgroup consisting of the 
products U{a, f3)D('j). The reader may care to check that the parametri- 
sations given in the definition above are unique. Thus \B\ = q^{q — 1) 



and |Sz(g)| = q {q + l){q — 1) ~ q . Moreover Sz{q) is a subgroup 
of Sp4(g) := {A G SL4(g),^*r^ = T} and the matrices of the form 
u{a,b,a, P)d{c,^) parametrise a Borel subgroup Bq of Sp4(g). 
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Let us now detail several lemmas concerning these groups. First, we need 
the following result concerning their subgroup structure. 

Lemma 3.2 (Subgroups of Sz{q)). Every proper subgroup H < Sz{q) is 
either a conjugate of Sz((7o) for some subfield Wq^ C Wg or else is a 3-step 
solvable group. 

Proof. This is a consequence of [24', Theorem 4.1]. The notation used there 
hails from the Atlas of finite groups and is not necessarily standard. Note, 
for example, that in parts (iii) and (iv) of |241 Theorem 4.1] one sees the 
notation C„ : 4, which refers to a semidirect product of the cyclic group 
Z/nZ by the elementary abelian group Z/2Z x Z/2Z. D 

The next lemma asserts that Sz{q) is not contained in any low-complexity 
subvariety of Sp4(F2); it is needed in order to give the second proof of 
Theorem 12.11 using the results in [3] . 

Lemma 3.3 {Sz{q) is sufficiently Zariski dense). The Suzuki group Sz(q) is 
not contained in any proper algebraic subgroup o/Sp4(F2) of complexity Mq, 
where Mq — )• oo as g —)• oo . 

Proof. See Appendix El In fact we can take Mq ~ g^'^, although the exact 
rate is not of importance for our argument. D 

Remark. For a full discussion of the notion of complexity of an algebraic 
variety, see our earlier paper [3j. In the context of this lemma, it simply 
means that the smallest degree of a polynomial which vanishes on Sz(g) but 
not on Sp4(F2) tends to infinity as q does. 

We will also require some less standard facts about the Suzuki groups. 
The first is a result of G. Jones, giving a lower bound on the shortest word 
that vanishes identically on Sz(g). 

Lemma 3.4 (No short word laws in the Suzuki group). Suppose that w is 
some word in the free group F2, such that w{a, b) = id for all a,b £ Sz{q)\B. 
Then w has length at least c^/q for some absolute constant c > 0. 

Proof. This follows from the proof of [151 Lemma 5] . For the convenience 
of the reader we sketch the argument in Appendix |Al D 

Finally, we will require the following result of Gamburd, Hoori, Shahsha- 
hani, Shalev and Virag [8]. 

Lemma 3.5 (Large girth). [8] Let G = Sz{q). There is an absolute constant 
K > such that, with probability 1 — Og^oo(l), cl randomly chosen pair 
a,b £ G will be such that w{a, b) 7^ id for all nontrivial words in the free 
group F2 with length at most nlogq. 

Here, recall, Og_!.oo(l) denotes a quantity that goes to zero as q ^ 00. 
Most of the details in the paper [5] are concerned with the case of Chevalley 
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groups, and mention of twisted groups such as the Suzuki groups under con- 
sideration here is confined to a few Unes. More importantly, the treatment 
of these groups depends on a paper of Hrushovski [14] which uses methods of 
mathematical logic. If we were to simply quote [8], then, our main theorem 
would be genuinely ineffectivqj. For these reasons we give a self-contained 
and rather elementary proof of Lemma [3.5l in Appendix[Bl leading to a value 
of K which is half that obtained in |8j. Although it is not strictly necessary 
for our paper we then indicate a more complicated, but still elementary, 
proof of exactly the bound in [8] . 

4. The nonconcentration estimate 

The aim of this section is to prove Theorem 12.31 We may assume that q 
is sufficiently large, since the claim is trivial for bounded q by selecting a, b 
to be an arbitrary pair of generators of Sz(g). (It has long been known that 
such generators exist: see |22j.) 

Note that it will be enough to prove this estimate for n of the order of 
logq (up to multiplicative constants). Indeed, if ii\^ {H) < q~ for some 

such n, then ^l7'")(//) = E./.5)(x)^l'}!(x-ii7) ^ U!;\h)YI^ < q~l for 

ah m ^ 0, since ^^^(i^) > ^.'^:l{x-' H) ^.^^^iH x) = /.^(x-ii/)^. 

We will in fact show that (j2.ip holds for randomly selected a and 6, with 
probability 1 — Og_j.oo(l) as g — )• oo. Note that this implies that randomly 
selected a and h do generate Sz((7) if q is sufficiently large. By Proposition 
I3.2l this task divides into two subtasks: we must handle the algebraic case in 
which H is 3-step solvable and the arithmetic case in which H = x~^ Sz(qQ)x 
for some Fg^ C Fg, x G G. 

The algebraic case. Our argument here is almost identical to that in [2]. 

Let Fk be the free group on k letters, and write Wk{L) for the ball of 
radius L about id in the word metric on F^. If G is a group, then for each 
i = 0, 1, 2, 3, . . . we define the i-fold commutator maps ipi : G'^ — )• G by 

V'o(5') = 9, 

^i{9o,9i) = [90, 9i], 

'4'2i9oo,goi,9io,9ii) = [boo, 501], [510,511]], 

V'3((5'a;)cje{0,ip) = [[[5000,5001], [5010,5011]], [[5100,5101], [5110,5111]]] 

and so on. For definiteness (though it scarcely matters) we use the group 
theorists' definition of commutator, namely [x,y] = x~^y~^xy. 

The following lemma is a very straightforward modification of [2j Propo- 
sition 8]. 



The inefTectivity would be of the foUowing type: one could give an explicit expansion 
constant k > which works for Sz{q), q ^ qo, but no effective bound on qg. 



SUZUKI GROUPS AS EXPANDERS 7 

Lemma 4.1. Suppose that S C Wk{L) is a set with the property that 

for all 2'- -tuples (su;Lg{o,i}i ^ S. Then \S\ <^i L'^K 

We remark that we will only need this lemma in the case k = 2,1 = 3, 
though it is scarcely more difficult to establish the general case. 
Proof. Write f{k, I, L) for the smallest function which works in the claimed 
bound; thus our desire is to show that /(/c,/,L) <Ci L'^^ ■ Suppose that S 
is a set with the stated property. Then the set S' := {[s,s'] : s,s' £ S} is 
a subset of VFfc(4L) with the property that all the (/ — l)-fold commutator 
maps are trivial on S'. Therefore we have 

\S'\^fik,l-l,4L). 

By the pigeonhole principle it follows that there exists some a € S' and 
b G S such that there is a set Sq C S, 

\So\^\S\/fik,l-l,4L), (4.1) 

such that [b, s] = a for all s £ Sq. 

Suppose that si,S2 G 5*0. Then we have s^ bsi = s^ bs2, which implies 
that b commutes with sis^ . By standard facts about the free group this 
implies that there is some word x and some integers m,n, |?7z|,|n| ^ 2L 
such that b = x"^ and sis^ = x". Since knowledge of x"^ and m uniquely 
determines x (another standard fact about the free group), we see that, 
assuming b is fixed, there are no more than (4L + 1)^ < (5L)^ possible 
values for sis^ . 

Double-counting pairs of elements of 5*0, it follows that \So\'^ ^ (5L)^|5o|. 
Comparing this with (|4.ip of course yields 

|5K(5L)2/(fc,/-l,4L). 

By the definition of / this means that 

fik,l,L)^{5Lffik,l-l,AL), 

and hence 

f{k, I, L) ^ 52'l2(4L)2 . . . (4'~^L)2 • f{k, 0, 4'L) ^ b^h^^L^^f{k, 0, 4'L). 

This concludes the proof. D 

We now have enough tools to conclude the analysis of the algebraic case. 
Select a random pair of elements a,b £ Sz(g). By Proposition 13.51 these are 
free up to length nlogq with probability 1 — Og_j.oo(l)- Set no := Klogg/100 
(say), and suppose that 

for some n ^ uq and for some 3-step solvable group H. Then by a simple 
averaging argument there is some x such that 
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As in the argument at the start of the section, this imphes that 

Note that ^a,b is counting walks of length 2no rather than distinct words; for 
example if no = 3 then ah could be counted as abb~^aa~^b. However, just 
as in [21 p. 637], it follows from Kesten's celebrated thesis [16] that no more 
than (2\/3)^"'' of the 4^"" walks of length 2no starting from the identity end 
up at any given point x. 

Now words in a,b of length up to 16(2no) behave freely, and so we can 

bound ^5°^(i/) above by (\/3/2)2"o times |W2(2no) n H\. Since H is 3- 
step solvable, the 3- fold iterated commutator -03 vanishes on H^. Applying 
Lemma l4. 11 we thus obtain 

/xiy°)(i/)«n«(V3/2)2"o. 

Since no ^ logg, it follows that t] < q^ for some suitably small absolute 
constant 6. This concludes the proof of the algebraic case. 

The arithmetic case. Recall that our aim is to show that 
sup fJ'abi^'^ Sz{qo)x) < q'^ 

for all n ~ C log g, for some choice of a, 6 G G = Sz(gr) and for some C, 5 > 0. 
We will in fact establish the bound 

Ea,feGSz(g) sup ^J^"^^(x"^ Sz(go)x) < g~^ (4.2) 

a;eG,F,oCFg ' 

from which the stated result is immediate by Markov's inequality. 
If i G SL4((7) is a matrix, write 

A^ + ci{t)\^ + C2{t)\^ + C3{t)X + 1 := det(t + A) 

for the characteristic polynomial of t; thus, for example, ci(t) = tr(t) and 
C3(i) = tr(t^^). Since the characteristic polynomial is invariant under con- 
jugation, for given a, b the supremum in (|4.2|) is bounded above by 

^w{ci{w{a,b)),C2iw{a,b)),C3{w{a,b)) e [J Fg^), 

F.qCF, 

where the probability is taken over all words w of length n ~ Clog 5. We 
conclude that 

(jMl) ^ CTl + CJ2 (4.3) 

where 

3 

cTi := E„,fcesz{g) Yl Yl Pu,{ci{w{a, b)) = x) (4.4) 

and 



»=l3:GUF,oCF5lf",0.^'5^0 



(^2 ■■= Ea^h^sz{q)^w{ci{w{a, 6)) = C2{w{a, b)) = C3{w{a, b)) = 0). (4.5) 
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To bound these quantities ai and (T2, we use the following simple lemma. 

Lemma 4.2 (Schwartz-Zippel type lemma). Suppose that F = Fg for some 
q = 2^"-+^, n > 1, and that P : ¥ x ¥ ^ ¥ is a polynomial in 2k vari- 
ables of degree at most d in any one of them. Set 9 := 2"^^. Then either 
P{xi, . . . , Xk,Xi, . . . , xf.) = for all xi, . . . ,Xk € ¥, or else the probability 
that P{xi, . . . ,Xk,x\, . . . ,x^) = for a random choice of xi, . . . ,2;^ S F is 
0{kdq-^l^). 

Proof. Write f{q, k, d) for the least quantity that will work as a bound in 
this lemma. Let P : F xF — ?■ F be a polynomial such that P{xi, . . . ,Xk,Xi, 
. . . , x^) does not vanish identically. Write 

P{xi,...,Xk,x{,...,xl) = ^ pjj>{x2,...,Xk,X2,...,xl)x\x\ , (4.6) 

where the pjj' are polynomials of degree at most d in each variable. Then 
there exists some j, j' for which 

PjJi{X2, . . . , Xk, X2, . . . , X)^) 

does not vanish identically. For a randomly selected 2A;-tuple (xi, . . . , x^, x^, 
. . . ,x^) we distinguish two cases: either 

Pjjr[X2, . . . ,Xk,X2, . . . , Xj^) = 0, 

or this is not so. The chance of the first case occurring is at most f{q,k—l,d). 
In the second case, let us fix rE2, . . . , x^, Xg, . . . , x^ and count the number of 
possibilities for xi,Xp In this case (14. 6p becomes a nontrivial polynomial 
equation of the form p(xi,xy = 0. Recalling that x^ = x^ , we may 
regard this as a polynomial equation of degree at most ^(2""*"^ + 1) in xi, 
and therefore it has at most d(2""^^ + l) solutions. Putting these observations 
together we obtain the inequality 

fig, k, d) ^ fig, k-l,d)+ (?-id(2"+i + 1). 

Iterating we obtain 

f{g,k,d)^kdg~\2^+' + l), 

which implies the claimed bound. D 

Remark. By using Lemma IB.1[ which gives a bound of 2d^ instead of the 
trivial bound of d{2"'~^^ + 1) for the number of solutions to p(xi, xf ) = 0, it 
is possible to obtain the stronger bound f{q,k,d) ^ 2kd?'/g in this lemma. 
We will not require this improvement in this paper. 

Using Lemma 14.21 we can bound the probability that Cj concentrates at a 
non-zero value. 
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Lemma 4.3 (Non-concentration away from zero). Suppose that w is a word 
in the free group F2 of length L, that x E Fg \ {0}, and that i = 1,2 or 3. 
Then if a,b are selected randomly from Sz(g) we have 

Fa,b{ci{w{a,b)) = x) -^ q-^/^L. 

Proof. The reader may wish to recah the explicit definition of Sz(g) given 
in Definition 13. li The probabiUty that either a or b hes in the subgroup B 
is at most 0{q~'^), which is acceptable. For all other a,b we may use the 
parametrisation U{a,f3)D{^)TU{a',(5'). To parametrise a and b we require 
10 variables, which we denote by xi, . . . , xiq. Clearing denominators arising 
from the appearance of 7"^ and 7" in 1^(7), we may rewrite the equation 
Ci{w{a, b)) = X in the form 

P{xi,...,xio,Xi,... ,x%) = 0, 

where P is a polynomial in 20 variables, of degree 0{L) in each of them (note 
that every matrix in Sp4(g) has determinant one, so there is no issue when 
w contains inverses of a and b). This polynomial does not vanish identically, 
since when a = b = T the characteristic polynomial of any word w{a,b) is 
simply A'^ + 1, so in this case ci{w{a,b)) = C2{w{a,b)) = C3{w{a,b)) = 0. 
We are assuming, however, that x ^ 0. The result now follows from Lemma 
21 □ 



An immediate corollary of Lemma l4.3l is a bound for the quantity cJi from 
dOD. 

Corollary 4.4. We have ai ^ q^^^^logq. 

Proof. By Lemma 14.31 and the definition of cJi we have 

ai«| \J ¥j-q-'/^L. 



F.qCF, 



Now simply observe that 



this being a consequence of the fact that 2n + 1 is odd, and hence has no 
proper factor larger than (2n + l)/3. D 

It remains to bound the quantity o"2 from (j4.5p . Note that if ci{w{a, b)) = 
C2{w{a, b)) = C3{w{a, b)) = then the characteristic polynomial of w{a, b) is 
simply A^ + 1 and hence, by the Cayley-Hamilton theorem, w{a, 6)^ = id. 
Once again, the probability that either a or 6 lies in the subgroup B C Sz{q) 
is just 0{q^'^). It therefore suffices to bound 

^a,beS7.{q)^w{a, b ^ B, w{a, b)^ = id). 
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The average over w involves words of length only 0{logq). By Lemma 13.41 
none of these words w{a,b)'^ (except the trivial word) vanishes identically 
on Sz(g) \ B. 

Parametrise pairs of elements of a, 6 G Sz(g) \ B as before, using ten 
parameters xi, . . . ,xiq. The condition that w{a,b)'^ = id leads to sixteen 
polynomial equations, each of degree 0(logq) in every variable. By Jones' 
result (and for Q sufficiently large), at least one of these polynomials must 
be nontrivial. Applying Lemma 14.21 we may conclude that 

cr2<.q'^^'^logq. (4.7) 

Combining this with Corollary 14.41 and recalling the definition of o"i , o"2 
(cf. (14. 3[) ) we obtain (14. 2p as desired. This concludes our analysis of the 
arithmetic case and hence the proof of Theorem 12.31 



5. Remarks on SL2(g). 

All of our arguments go through (and are considerably simpler) for SL2(g), 
q = p", p prime, n ^ 1. It is, however, already known due to work of 
Lubotzky [20] (referenced in [IT]) that the groups SIj2{q) are uniformly ex- 
panders (with varying p and n) for some rather explicit sets of three gen- 
erators and their inverses. The methods of this paper, then, provide an 
alternate proof of Lubotzky's result. There are perhaps three good reasons 
for wishing to have such an alternative proof. Firstly, we avoid the use of 
any deep number-theoretic information. Secondly, we can get two genera- 
tors rather than three. Thirdly, the result itself is of paramount importance 
in the proof of the main theorem of Kassabov-Lubotzky-Nikolov [17J. 

Everything goes through in much the same way as for the Suzuki groups. 
The "Helfgott- type" result for SL2(g) (analogous to Theorem 12. ip was ob- 
tained by Helfgott himself [12] for q prime, and was subsequently general- 
ized to arbitrary q in Oren Dinai's thesis (see [7]). As for the proof of the 
non-concentration estimate, the algebraic case involves, apart from some 
subgroups of bounded order (^ 60), only 2-step solvable groups rather than 
3-step. We refer the reader to Dickson's book [6l XH, 260] for a careful 
description of the subgroup structure of SL2{q). Finally, there is a much 
more elementary argument for the arithmetic case, which we now give. 

Recall that the aim is to show that, for a proportion 1 — Og^ooi^) of pairs 
a,b £ G, we have 

ti^^lix-'SL2{qo)x)<q-' (5.1) 

for some 5 > and for all n ^ Clogg, uniformly for all x E G and all 
qo = q^'^, r > 1. In SL2, the characteristic polynomial is determined by the 
trace, so we can proceed exactly as in the last section with the trace in place 
of the Cj's, except now there is a somewhat easier way to obtain a bound 

l+e 



K,bGSUiq) {tT{w{a, b))=x)<. g-^+^ (5.2) 
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which suffices for our purposes in exactly the same way that Corollary 
and the bound (I4.7P did in the previous section. Indeed, suppose that 

tl t2 \ , (u\ U2 



with ti,ui 7^ 0. For a fixed word w of length O(logg), the condition 
tr{w{a,b)) = X can be written (by clearing denominators ti,ui) as a poly- 
nomial of degree 0(logq) in the variables tj,Uj. By Borel's theorem |lj (see 
also [H]), the word map w : SL2 x SL2 — )• SL2 is a dominant map as soon 
as w is non-trivial. This implies that our polynomial is not constant. Thus 
it is not hard to see that the number of solutions to it is 0{q^logq) by 
a somewhat simpler, untwisted, version of the counting argument given in 
Proposition 14.21 above. The number of pairs a, b in which the top left entry 
of either a or 6 is zero is clearly 0{q^), and this completes the proof of (j5.2p . 
To get (|5.ip it suffices to observe that the subset of elements in ¥q which 
lie in a proper subfield has size O(y^). This completes the argument. 

Appendix A. A result of G. A. Jones 
In this appendix, we prove Lemmas 13.31 and 13.41 following G. Jones' paper 

m- _ 

We keep the notation of Section [3l Let q = 2^"+^ and let k = ¥2 he 
the algebraic closure of F2. We will view Sp4(fc) as a closed affine algebraic 
subset of 4 X 4 matrices over k, i.e. as a subset of k^^. We view it as an 
algebraic group endowed with the Zariski topology. Let / be a polynomial 
in 16 variables over k. We begin by establishing the following lemma, which 
forms the heart of Jones' argument. 

Lemma A.l. There is a constant c > such that if f has degree at most 
Cyjq, and f vanishes identically on Sz{q) \B, then f vanishes identically on 
Sp^{k). 

Proof. Suppose / has degree at most M := q^''^/W is each of the 16 vari- 
ables. Recall the parametrisation of the (lower triangular) Borel subgroup 
Bq of Sp4(fc) as M(a, 6, a, /3)d(c, 7). The set BqTBq can be parametrised by 
matrices of the form u{a,b,a, l3)d{c,'y)Tu{a' ,b',a', 13'). It is the so-called 
"big-cell" of the Bruhat decomposition of Sp4(fc). It is Zariski-open and 
hence Zariski-dense in Sp4(A;). Hence if / vanishes entirely on BqTBq, it 
must vanish entirely on Sp4(A;). After multiplying / by (7c) in order to 
clear denominators, we obtain a polynomial P = P{a, b, a, 13, c, 7, a', b', a' ,f3') 
of degree at most 2M in each of the ten variables, with the property that 
P = if and only if 7c = or if / = 0. It particular if P vanishes on k^^ , 
i.e. is the zero polynomial, then / vanishes identically on Sp4(fc). 

Recall from Definition 13.1 1 that any element g G Sz{q)\B can parametrised 
as 

g = u{a\p\ a, l3)d{^\^)Tu{a'\l3'\ a', /?'). 
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Let Q be the polynomial in 5 variables defined as 

Q(a,/3,7,a',/3') = P(a^/3^a,/3,7^7,a'^/3'^a^/3'). 

Since (by assumption) / vanishes on Sz(q') \ B, Q must vanish identically 
for all values of a,/3,7,a',/3' taken in Fg. We shall prove that, due to the 
upper bound on M, this forces the polynomial P{a, b, a, 13, c, 7, a', b', a', /?') 
to be the zero polynomial and that will complete the argument. 

Note that Q has degree at most 2M{9 + 1) in each of its 5 variables. Since 
1OM{0 + 1) < q, Q must take a non-zero value on F^ unless Q is formally 
zero. This follows from the well-known fact that any non-zero polynomial in 
k variables over ¥q of degree at most d in each variable has at most kdq 
solutions in F^, by the Schwartz-Zippel lemma (which one can prove by an 
induction on dimension argument similar to that used to prove Lemma [4. 2 1) . 
Therefore Q is formally zero. 

Suppose now that P is formally non-trivial. Then there must be two 
distinct monomials mi = a"^'^b'^ba"^°' . . . c"^c' and m2 = a"^'^b'^ba"^" . . . c""c' 
appearing in P which coincide after the substitution a \-^ a , b \-^ f3 , 
c I— 7- 7 . This means that ^m^ -|- m^ = Orn^ -\- rn^ and similarly for b and c. 
However, since 2M < 9, this forces m\ = m^ and m\, = m^ and similarly 
for 6, c, /? and 7, that is to say mi = m2- This contradiction implies that P 
is formally trivial and this ends the proof of the lemma. D 

We are now in a position to deduce Lemma [3. 31 whose statement we recall 
now. 

Lemma 13.31 The Suzuki group Sz((7) is not contained in any proper alge- 
braic subgroup of Sp4^{k) of complexity Mq, where Mq — )• 00 as g —)• 00. 

Proof. Let ^ be a proper closed subvariety of Sp4(A:) of complexity at most 
M containing Sz[q). Then by the definition of complexity (cf. [31 Section 
3]), there is a polynomial / on 16 variables over k and of degree at most 
M, which does not vanish identically on Sp4(A:) yet is identically zero on 
V . We are thus in a position to apply the above lemma an conclude that 

M ^ cgi/2. n 

Our other business is to establish Lemma 13.41 

Lemma 13.41 Suppose that w is some word in the free group F2, and that 
w{a,b) = id identically on Sz{q) \ B. Then w has length at least c-^/g for 
some absolute constant c > 0. 

Proof. Suppose not. The relation w{a, b) = id can be written as 16 poly- 
nomial equations of degree <C q^''^ in the 16 matrix coordinates. Applying 
Lemma [A. II to each of them, we conclude that w{a, b) = id for every a and b 
in Sp4(A;). We then observe that Sp4(A:) contains a closed algebraic subgroup 
isomorphic to SL2(A;) (for example the subgroup of Sp4 fixing the vectors ei 
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and 64). On the other hand it follows from Borel's resulcl pLj that w{a, 6) — id 
cannot vanish identically on SL2(A;) x SL2(A:) unless w is trivial. D 

Appendix B. Randgm Cayley graphs of Sz(g) have large girth 

Our aim in this appendix is to supply a self-contained proof of Lemma [3. 51 
The argument is basically the same as that in [8], only we use elementary 
estimates instead of the deep model-theoretic work of Hrushovski. We first 
recall the statement of the lemma. 

Lemma 13.51 Let G = Sz(g). There is an absolute constant k > such that, 
with probability 1 — Oq_j.oo(l); 0, randomly chosen pair a,b £ G will be such 
that w{a, b) 7^ id for all nontrivial words w in the free group F2 with length 
at most Klogq. 

Proof. Let k > be a quantity to be specified later. The number of 
nontrivial words tf G -F2 of length at most nlogq is <^ g«^log3 p^^. gg^^]^ 
of them, let us estimate the probability that a random pair of elements 
a,b € Sz(g) satisfies w{a, b) = id. The probability that either a or 6 lies in 
the subgroup B (cf. Definition 13. ip is 0{q~^) and will be ignored. The other 
elements may be parametrised as U{a, f3)D{'~f)TU{a' , /3'). Parametrising a 
and b by ten variables xi, . . . , xio, the equation w{a, b) = id is equivalent to 
sixteen polynomial equations 

-Pij[Xl, . . . , Xio, Xi, . . . , X]^o) — 0, 

one for each matrix entry, where each Pij has degree O(logg) in each vari- 
able (we have cleared the denominators 7"^ and 7"^"^ appearing in the 
expression by multiplying by a 0{logq) power of 7 and 7^ to get Pij)- 

By Lemma [3^ we do not have w{a, b) = id identically for a,b £ Sz{q) \ B, 
and so at least one of these polynomial equations is nontrivial. By Lemma 
1421 it follows that 

P„,6(a, b i B, w{a, b) = id) « (7-1/2 j^g ^ 

Summing over w, one sees that the probability that a randomly selected pair 
o, b will satisfy any word of length Klog q is bounded by 0(g'''°s3^i/2 j^g ^y 
Choosing k < 1/2 log 3, the result follows immediately. D 

As we remarked in the overview, our lower bound for the girth is precisely 
half that of [8]. The reason for this is the rather crude bound on the number 
of solutions to p{x, x^) = that we employed during the proof of Lemma [4. 21 
By employing the following lemma instead (with appropriate modifications 
of Lemma 14. 2p we may recover the bound of [8] . This lemma is plausibly of 
independent interest. 



This particular fact may be also be proven more elementarily by a ping-pong argument, 
noting that w would also have to vanish on SL2(fc(t)) x SL2(fe(i)), where t is an indeter- 
minate. This avoids an appeal to the Tits Alternative. 
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Lemma B.l. Let q = 2^"+^ and write 9 := 2"-^^. Let p{x,y) G Fg[x,y] be 
a nontrivial polynomial with degree at most d in each of its variables. Then 
the number of solutions to p{x, x ) = with x G Fg is at most 2d'^ . 

Proof. Suppose that p{x,x^) = 0. Raising to the power 6 and recalhng 
that X I— 7- X is an automorphism with (x ) = x^, we obtain 

/(x^x2) = 0. (B.l) 

Here, p means the polynomial obtained from p by raising each coefficient to 
the power 6. Motivated by this observation let us consider the more general 
problem of bounding above the number of solutions to 

p^{y,x^) =p{x,y) = 

with x,y G Fg. Write P{x,y) = p^{y,x'^); note that the total degree of 
P is at most 2d. Let f{x,y) be the highest common factor of p and P in 
¥q[x, y]. Multiplying by a suitable unit, we may take /(x, y) to lie in Fg[x, y]: 
indeed any Gal(Fg/Fg)-conjugate of /(x, y) is also a common factor of p and 
P and so every irreducible factor of /(x, y) comes together with all of its 
conjugates, and the product of those is defined over Fg. Write p = fpi and 

P = fPi. 

Write d' for the total degree of /, and suppose that d' < d. By Bezout's 
theorem, the number of solutions to pi{x,y) = Pi{x,y) = is at most 
{2d — d'){d — d'). All other solutions to p{x,x ) = must also satisfy 
/(x,x^) = 0, and so we may thus proceed inductively to conclude that 
the total number of x G Fg with p{x, x^) = is at most 

{2d-d'){d-d') + 2d'^ i^2d^, 

as required. 

li d' = d then we must proceed differently. In this case p{x, y) divides 
P{x,y), and so we may write 

P{x, y) = p^y, x^) = g{x, y)p{x, y) 

for some polynomial g{x,y) G Fq[x,y]- Here, and henceforth, we use = to 
denote equivalence of polynomials (and not just expressions that are equal 
for all substitutions of variables from Fg). Making an obvious substitution, 
we have 

p\x^,y^) = 9{y,x^)p{y,x^)- 

Raising both sides to the power then yields 

p\x'\y'')^g\/,x'')p'{/,x'') 

and hence that 

p\x\y^)=g'{y,x^)p'{y,x^). 
It follows that 

p{x, yf = p^(x^ y^) = g\y, x^)p\y, x^) = g\y, x^)g{x, y)p{x, y), 
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and so 

P(.x,y) =g%y,x^)g{x,y). 
In particular p is reducible over F^ and we may proceed by induction on the 
total degree of p. D 

The second author would like to thank Michael Larsen for helpful conver- 
sations in connection with this lemma. 
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